NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Multi-Modal Contrastive Learning for Proteins by Combining Domain-Informed Views

Xu, Haotian; You, Yuning; Shen, Yang (March 2024, Machine Learning for Genomics Explorations workshop at ICLR 2024)

Proteins, often represented as multi-modal data of 1D sequences and 2D/3D structures, provide a motivating example for the communities of machine learning and computational biology to advance multi-modal representation learning. Protein language models over sequences and geometric deep learning over structures learn excellent single-modality representations for downstream tasks. It is thus desirable to fuse the single-modality models for better representation learning, but it remains an open question on how to fuse them effectively into multi-modal representation learning with a modest computational cost yet significant downstream performance gain. To answer the question, we propose to make use of separately pretrained single-modality models, integrate them in parallel connections, and continuously pretrain them end-to-end under the framework of multimodal contrastive learning. The technical challenge is to construct views for both intra- and inter-modality contrasts while addressing the heterogeneity of various modalities, particularly various levels of semantic robustness. We address the challenge by using domain knowledge of protein homology to inform the design of positive views, specifically protein classifications of families (based on similarities in sequences) and superfamilies (based on similarities in structures). We also assess the use of such views compared to, together with, and composed to other positive views such as identity and cropping. Extensive experiments on enzyme classification and protein function prediction benchmarks demonstrate the potential of domain-informed view construction and combination in multi-modal contrastive learning
more » « less
Full Text Available
EMONA: Event-level Moral Opinions in News Articles

https://doi.org/10.18653/v1/2024.naacl-long.293

Lei, Yuanyuan; Miah, Md_Messal Monem; Qamar, Ayesha; Reddy, Sai Ramana; Tong, Jonathan; Xu, Haotian; Huang, Ruihong (June 2024, Association for Computational Linguistics)

Full Text Available
Localising change points in piecewise polynomials of general degrees

https://doi.org/10.1214/21-EJS1963

Yu, Yi; Chatterjee, Sabyasachi; Xu, Haotian (January 2022, Electronic Journal of Statistics)

Full Text Available
Objective Detection of Eloquent Axonal Pathways to Minimize Postoperative Deficits in Pediatric Epilepsy Surgery Using Diffusion Tractography and Convolutional Neural Networks

https://doi.org/10.1109/TMI.2019.2902073

Xu, Haotian; Dong, Ming; Lee, Min-Hee; O'Hara, Nolan; Asano, Eishi; Jeong, Jeong-Won (August 2019, IEEE Transactions on Medical Imaging)

Full Text Available
Directionally Convolutional Networks for 3D Shape Segmentation

https://doi.org/10.1109/ICCV.2017.294

Xu, Haotian; Dong, Ming; Zhong, Zichun (October 2017, 2017 IEEE International Conference on Computer Vision (ICCV))

Full Text Available
Directionally Convolutional Networks for 3D Shape Segmentation

Xu, Haotian; Dong, Ming; Zhong, Zichun (January 2017, IEEE International Conference on Computer Vision (ICCV))

Previous approaches on 3D shape segmentation mostly rely on heuristic processing and hand-tuned geometric descriptors. In this paper, we propose a novel 3D shape representation learning approach, Directionally Convolutional Network (DCN), to solve the shape segmentation problem. DCN extends convolution operations from images to the surface mesh of 3D shapes. With DCN, we learn effective shape representations from raw geometric features, i.e., face normals and distances, to achieve robust segmentation. More specifically, a two-stream segmentation framework is proposed: one stream is made up by the proposed DCN with the face normals as the input, and the other stream is implemented by a neural network with the face distance histogram as the input. The learned shape representations from the two streams are fused by an element-wise product. Finally, Conditional Random Field (CRF) is applied to optimize the segmentation. Through extensive experiments conducted on benchmark datasets, we demonstrate that our approach outperforms the current state-of-the-arts (both classic and deep learning-based) on a large variety of 3D shapes.
more » « less
Full Text Available

Search for: All records